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We transform a Muller game with n vertices into a safety game with (n!) 3 vertices whose solution 
allows to determine the winning regions of the Muller game and to compute a finite-state winning 
strategy for one player. This yields a novel antichain-based memory structure and a natural notion 
of permissive strategies for Muller games. Moreover, we generalize our construction by presenting a 
new type of game reduction from infinite games to safety games and show its applicability to several 
other winning conditions. 



1 Introduction 

Muller games are a source of interesting and challenging questions in the theory of infinite games. They 
are expressive enough to describe all ou-regular properties. Also, all winning conditions that depend 
only on the set of vertices visited infinitely often can trivially be reduced to Muller games. Hence, they 
subsume Biichi, co-Biichi, parity, Rabin, and Streett conditions. Furthermore, Muller games are not 
positionally determined, i.e., both players need memory to implement their winning strategies. In this 
work, we consider three aspects of Muller games: solution algorithms, memory structures, and quality 
measures for strategies. 

To date, there are two main approaches to solve Muller games: direct algorithms and reductions. 
Examples for the first approach are Zielonka's recursive polynomial space algorithm l22l . which is based 
on earlier work by McNaughton lfl7l . and Horn's polynomial time algorithm for explicit Muller games 
lfT2l . The second approach is to reduce a Muller game to a parity game using Zielonka trees Q or latest 
appearance records (LAR) [TP . 

In general, the number of memory states needed to win a Muller game is prohibitively large Q. 
Hence, a natural task is to reduce this number (if possible) and to find new memory structures which may 
implement small winning strategies in subclasses of Muller games. 

As for the third aspect, to the best of our knowledge there is no previous work on quality measures 
for strategies in Muller games. This is in contrast to other winning conditions. Recently, much attention 
is being paid to not just synthesize some winning strategy, but to find an optimal one according to a 
certain quality measure, e.g., waiting times in request-response games [13] and their extensions ||23| . 
permissiveness in parity games [1,3], and the use of weighted automata in quantitative synthesis (2l[5l. 
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Inspired by work of McNaughton |fT8l , we present a framework to deal with all three issues. Our 
main contributions are a novel algorithm and a novel type of memory structure for Muller games. We 
also obtain a natural quality measure for strategies in Muller games and are able to extend the definition 
of permissiveness to Muller games. 

While investigating the interest of Muller games for "casual living-room recreation" [18], Mc- 
Naughton introduced scoring functions which describe the progress a player is making towards winning 
a play: consider a Muller game (&/, where srf is the arena and (jFq, &\ ) is a partition of the set 

of loops in srf used to determine the winner. Then, the score of a set F of vertices measures how often 
F has been visited completely since the last visit of a vertex not in F. Player i wins a play if and only if 
there is an F £ J^- such that the score of F tends to infinity while being reset only finitely often (a reset 
occurs whenever a vertex outside F is visited). 

McNaughton proved the existence of strategies for the winning player that bound her opponent's 
scores by |^/|! [fl~8 ], provided the play starts in her winning region. The characterization above implies 
that such a strategy is necessarily winning. The bound \&/\ ! was subsequently improved to 2 (and shown 
to be tight) 0. Since some score eventually reaches value 3, the winning regions of a Muller game can 
be determined by solving the reachability game in which a player wins if she is the first to reach a score 
of 3 0. However, it is cumbersome to obtain a winning strategy for the infinite-duration Muller game 
from a winning strategy for the finite-duration reachability game. The reason is that one has to carefully 
concatenate finite plays of the reachability game to an infinite play of the Muller game: reaching a score 
of 3 infinitely often does not prevent the opponent from visiting other vertices infinitely often. 

The ability to bound the losing player's scores can be seen as a safety condition as well. This allows 
us to devise an algorithm to solve Muller games that computes both winning regions and a winning 
strategy for one player. However, we do not obtain a winning strategy for the other player. In general, it 
is impossible to reduce a Muller game to a safety game whose solution yields winning strategies for both 
players, since safety conditions are on a lower level of the Borel hierarchy than Muller conditions. 

Given a Muller game, we construct a safety game in which the scores of Player 1 are tracked (up 
to score 3). Player wins the safety game, if she can prevent Player 1 from ever reaching a score of 3. 
This allows to compute the winning region of the Muller game by solving a safety game. Furthermore, 
by exploiting the intrinsic structure of the safety game's arena we present an antichain-based memory 
structure for Muller games. Unlike the memory structures induced by Zielonka trees, which disregard 
the structure of the arena, and the ones induced by LARs, which disregard the structure of the winning 
condition (J^o,^i)> our memory structure takes both directly into account: a simple arena or a simple 
winning condition should directly lead to a small memory. The other two structures only take one source 
of simplicity into account, the other one can only be exploited when the game is solved. Furthermore, our 
memory also implements the most general non-deterministic winning strategy among those that prevent 
the opponent from reaching a certain score in a Muller game. Thus, our framework allows to extend the 
notion of permissiveness from positionally determined games to games that require memory. 

Our idea of turning a Muller game into a safety game can be generalized to other types of winning 
conditions as well. We define a weaker notion of reduction from infinite games to safety games which 
not only subsumes our construction but generalizes several constructions found in the literature. Based 
on work on small progress measures for parity games [14J, Bernet, Janin, and Walukiewicz showed how 
to determine the winning regions in a parity game and a winning strategy for one player by reducing it 
to a safety game flj. Furthermore, Schewe and Finkbeiner l20l as well as Filiot, Jin, and Raskin iPTOl 

1 This reachability game was the object of study in McNaughton's investigation of humanly playable games, which, for 
practical reasons, should end after a bounded number of steps. 
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used a translation from co-Buchi games to safety games in their work on bounded synthesis and LTL 
realizability, respectively. We present further examples and show that our reduction allows to determine 
the winning region and a winning strategy for one player by solving a safety game. Thus, all these games 
can be solved by a new type of reduction and an algorithm for safety games. Our approach simplifies the 
winning condition of the game - even down the Borel hierarchy. However, this is offset by an increase 
in the size of the arena. Nevertheless, in the case of Muller games, our arena is only cubically larger 
than the arena constructed in the reduction to parity games. Furthermore, a safety game can be solved in 
linear time, while the question whether there is a polynomial time algorithm for parity games is open. 

2 Definitions 

The power set of a set S is denoted by 2 s and N denotes the set of non-negative integers. The prefix 
relation on words is denoted by C. For p G V m and L C V m we define Pref(p) = {w e V* |wCp} and 
Pref(L) = U peL Pref(p). For w = w\ ■ --Wn, let Last(vv) = w n . 

An arena = (V, Vo,V\,E) consists of a finite, directed graph (V,E) without terminal vertices, 
Vo C V and V\ = V \ Vq, where V; denotes the positions of Player i. We require every vertex to have an 
outgoing edge to avoid the nuisance of dealing with finite plays. The size of stf is the cardinality of 
V . A loop C C V in is a strongly connected subset of V, i.e., for every v, V G C there is a path from v to 
v' that only visits vertices in C. A play in starting in v G V is an infinite sequence p = P0P1P2 • • • such 
that po = v and (p„,p„+i) G E for all n G N. The occurrence set Occ(p) and infinity set Inf(p) of p are 
given by Occ(p) = {v G V \ 3n G N such that p„ = v} and Inf(p) = {v G V \ 3 m n G N such that p„ = v}. 
We also use the occurrence set of a finite play infix w, which is defined in the same way. The infinity set 
of a play is always a loop in the arena. A game W = Win) consists of an arena &f and a set Win C V m 
of winning plays for Player 0. The set of winning plays for Player 1 is V m \ Win. 

A strategy for Player i is a mapping a: V*Vi — > V such that (v, a(wv)) G E for all wv G V*V,-. We 
say that a is positional if a(wv) = a(v) for every wv G V*Vi. A play P0P1P2 • • • is consistent with a if 
Pn+i =G(Po"'Pn) f° r every n with p„ G Vj. For v G V and a strategy o for one of the players, we define 
the behavior of (7 from v by Beh(v, a) = {p G V m \ p play starting in v that is consistent with a}. A 
strategy a for Player i is a winning strategy from a set of vertices W C V if every p G Beh(v, a) for v G W 
is won by Player i. The winning region W, (^) of Player i in contains all vertices from which Player i 
has a winning strategy. We always have Wq PI Wi (5f ) = and <S is determined if Wo U W\ (W) = V. 
A winning strategy for Player i is uniform, if it is winning from all v G Wi(9?). 

A memory structure OK = (M,Init,Upd) for an arena (V,Vq,Vi,E) consists of a finite set M of 
memory states, an initialization function Init : V — > M , and an update function Upd : M xV — > M. 
The update function can be extended to Upd* : V + — > M in the usual way: Upd*(p ) = Init(p ) and 
Upd* (p . . . p,,p n + 1 ) = Upd (Upd* (po . . . p n ) , p n + 1 ) ■ A next-move function (for Player /) Nxt : Vj x M -> V 
has to satisfy (v,Nxt(v,m)) G E for all v G Vi and all m G M. It induces a strategy a for Player i with 
memory Wl via a(po . . .p«) = Nxt(p„,Upd*(po . ..p n )). A strategy is called finite-state if it can be im- 
plemented with a memory structure. The size of SOT (and, slightly abusive, a) is \M\. 

We consider two types of games defined by specifying Win implicitly. A safety game is a tuple 
<g = (£?,F) with F C V and Win = {p eV w \ Occ(p) CF}. A Muller game is a tuple §f = (^,#0,^1) 
where J^o is a set of loops of sf, J^i contains the loops which are not in J^b> and Win = {p G | 
Inf(p) G J^b}> i-e., p is winning for Player i if and only if Inf(p) G Safety games are determined 
with uniform positional strategies and Muller games are determined with uniform finite-state strategies 
of size \si\ \ CD. 
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3 Scoring Functions for Muller Games 

In this section, we introduce scores and accumulators for Muller games. These concepts describe the 
progress of a player throughout a play. Intuitively, for each set F C V, the score of F of a play prefix w 
measures how often F has been visited completely since the last visit of a vertex that is not in F or since 
the beginning of w. The accumulator of the set F measures the progress made towards the next score 
increase: Acc/r(w) contains the vertices of F seen since the last increase of the score of F or the last visit 
of a vertex v ^ F, depending on which occurred later. For a more detailed treatment we refer to ll9l[T8l. 

Definition 1. Let w G V + , v G V, and F C V. 

• Define Sc/ V i(v) = 1 and Acc{ v \ (v) = 0, and for F ^ {v} define Scf-(v) = and Accf (v) = Fn{v}. 

• Ifv G" F, then Sc^- (wv) = and Accp (wv) = 0. 

• Ifv G F and AcCf(vv) = F \ {v}, then Sep (wv) = Sep (w) + 1 and AcCf(wv) = 0. 

• Ifv G T 7 and Accf(w) / F \ {v}, ?/z<?« Sc^ (wv) = Scf(w) and Acc/r(wv) = Accf-(w) U {v}. 

Also, for J"C2 V de/wie MaxScj? : V + L>V m -> NU {oo} fry MaxScjr(p) = max Fe jrmax wEp Sc F (w). 

Example 1. LetV = {0, 1,2} andF = {0,1}. W? We Sc f (10012100) = 1 anrf Acc F (10012100) = {0}, 
MaxSc| f j(10012100) = 2, owe ?o ?/ze prefix 1001. 77ie score for F is reset to fry f/je occurrence of 
2, i.e., Sc F (10012) = and Acc F (10012) =0. 

If w is a play prefix with Scp(w) > 2, then F is a loop of the arena. In an infinite play p, Inf(p) 
is the unique set F such that Sc F tends to infinity while being reset to only finitely often. Hence, 
MaxScjr, ( (p) < oo implies Inf(p) G Also, we always have AcCf (w) C F. 

Next, we give a score-based preorder and an induced equivalence relation on play prefixes. 

Definition 2. Let & C 2 V and w,w' G V + . 

1. w is ^-smaller than w', denoted by w < w', if Last(w) = Last(w') and for all F G 

• Scf(w) < Scf(w'), or 

• Sep (w) = Scf {W) and Accp (w) CAccf(w'). 

2. w and" w' are ^-equivalent, denoted by w =jr w', ifw < jr w' and W < w. 

The condition w =jf w' is equivalent to Last(w) = Last(w') and for every F G & the equalities 
Scf(w) = Scf (w') and Acc^ (w) = Accf (w ; ) hold. Thus, =jr is an equivalence relation. Both <jr and 
=jr are preserved under concatenation, i.e., = & is a congruence. 

Lemma 1. Le? J'C2 1 ' and w,w' G V + . 

2. 7/w < w', f/jen ww <j? w'ufor all u G V*. 

2. 7/w = w', then wu =jr w'ufor all u G V*. 

4 Solving Muller Games by Solving Safety Games 

In this section, we show how to solve a Muller game by solving a safety game. Our approach is based 
on the existence of winning strategies for Muller games that bound the losing player's scores by 2. 

Lemma 2 ([9]). In every Muller game = (£/,JPo,J^\) Player i has a winning strategy O from Wj(&) 
such that MaxSc jr,_,-(p) — 2 for every play p G Beh(v, a) with v G Wi(W). 
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The following example shows that the bound 2 is tight. 
Example 2. Consider the Muller game 5f = (^,«^o>«^i)> where stf is depicted in Figure\J\ JFq = 
{{0}, {2}, {0, 1,2}} and &\ = {{0, 1}, {1,2}}. By alternatingly moving from 1 to and to 2, Player 
wins from every vertex, i.e., we have Wq(<£) = {0, 1,2}, and she bounds Player 1 's scores by 2. However, 
he is able to achieve a score of two: consider a play starting at 1 and suppose (w.l.o.g.) that Player 
moves to vertex 0. Then, Player 1 uses the self-loop once before moving back to 1, thereby reaching a 
score of 2 for the loop {0, 1} G &\. 



A simple consequence of Lemma [2] is that a vertex v is in Player 0's winning region of the Muller 
game 5f if and only if she can prevent her opponent from ever reaching a score of 3 for a set in &\ . 
This is a safety condition which only talks about small scores of one player. To determine the winner 
of , we construct an arena which keeps track of the scores of Player 1 up to threshold 3. The winning 
condition F of the safety game requires Player to prevent a score of 3 for her opponent. 
Theorem 1. Let £f be a Muller game with vertex set V. One can effectively construct a safety game &s 
with vertex set V s and a mapping f: V — > V s with the following properties: 

1. For every v G V: v G W/(S?) if and only if f(v) G Wi(%). 

2. Player has a finite-state winning strategy from Wq(W) with memory M C Wb(^s). 



Note that the first statement speaks about both players while the second one only speaks about 
Player 0. This is due to the fact that the safety game keeps track of Player 1 's scores only, which al- 
lows Player to prove that she can prevent him from reaching a score of 3. But as soon as a score of 3 is 
reached, the play is stopped. To obtain a winning strategy for Player 1 , we have to swap the roles of the 
players and construct a safety game which keeps track of the scores of Player 0. Alternatively, we could 
construct the arena which keeps track of both player's scores. However, that would require to define two 
safety games in this arena: one in which Player has to avoid a score of 3 for Player 1 and vice versa. 
This arena is larger than the ones in which only the scores of one player are tracked (but still smaller 
than (|V | !) 3 ). It is well-known that it is impossible to reduce a Muller game to a single safety game and 
thereby obtain winning strategies for both players. We come back to this in Section [6] 

We begin the proof of Theorem Q] by defining the safety game 9?$- L et ^ = ,^o,^\) with arena 
gf = (V,V ,V h E). We define 

Plays <3 = {w j w play prefix in W and MaxScjr, (w) < 3} 

to be the set of play prefixes in 'S in which the scores of Player 1 are at most 2 and we define 

Plays =3 = {wo • • • w n+ \ \wq ■ ■ • w n +\ play prefix in , MaxSc,^ (wq ■ ■ ■ w n ) < 2, and 



to be the set of play prefixes in which Player 1 just reached a score of 3. Furthermore, let Plays <3 = 
Plays <3 UPlays =3 . Note that these definitions ignore the scores of Player 0. The arena of the safety game 
we are about to define is the = -quotient of the unraveling of srf up to the positions where Player 1 
reaches a score of 3 for the first time (if he does at all). 




Figure 1: The arena j# for Example |2] 




MaxScjr, (w • • • w„w n+ i) = 3} 
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Figure 2: The safety game ^ for <S from Example |2] (vertices drawn with double lines are in F); the 
dashed line separates the winning regions. 



Formally, we define % = ((V S ,V£,V?,E S ),F) where 
. yS = Plays< 3 / = ^, 

• V? = {[w] =n | [w]= fi G V s and Last(w) G V,} for i G {0, 1}, 

• ([w] =J?i , [wv] =3r ) G E s for all w G Plays <3 and all v with (Last(w),v) G Z?H, and 

• F = Plays <3 / =JV 

The definitions of Vq and Vf are independent of representatives, as w =jr, w' implies Last(w) = 
Last(w'), we have V s = V ( f U Vf due to V = Vq U V\, and T 7 is well-defined, since every equivalence class 
in Plays <3 j = 9 is also one in Plays <3 / =jr . Finally, let /(v) = [v]=^. for every v G V. 

Remark 1. If([w]=p , ) & E s , then we hove (Last (w), Last (V)) G E. 

For the sake of readability, we denote = ^ -equivalence classes by [w] from now on. All definitions 
and statements below are independent of representatives and we refrain from mentioning it from now on. 

Example 3. To illustrate these definitions, Figure\2\depicts the safety game &sfo r the Muller game & 
from Example^ One can verify easily that the vertices [v]for v G V are in the winning region of Player 0. 
This corresponds to the fact that Player 0's winning region in the Muller game contains every vertex. 

The proof of Theorem Q]is split into several lemmata. Due to determinacy of both games, it suffices 
to consider only one Player (we pick i = 0) to prove Theorem lllll 

To win the safety game, we simulate a winning strategy for the Muller game that bounds Player 1 's 
scores by 2, which suffices to avoid the vertices in V s \F, which encode that a score of three is reached. 

Lemma 3. For every v G V: ifv G Wo(^) then [v] G Wq(%). 

2 Hence, every vertex in Plays =3 is terminal, contrary to our requirements on an arena. However, every play visiting these 
vertices is losing for Player no matter how it is continued. To simplify the following proofs, we refrain from defining outgoing 
edges for these vertices. 
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For the other direction of Theorem lllll we show that a subset of Wo(&s) can be turned into a memory 
structure for Player in the Muller game that induces a winning strategy. We use the =jr 1 -equivalence 
class of w as memory state to keep track of Player l's scores in <$ . But instead of using all equivalence 
classes in the winning region of Player 0, it suffices to consider the maximal ones with respect to <^ x 
that are reachable via a fixed positional winning strategy for her in the safety game. Formally, we have 
to lift <jf, to equivalence classes by defining [w] <jr, \w'\ if and only if w <jr, w' . 

The following proof is similar to the reductions from co-Biichi ifTOl fT6l l20l and parity games (H to 
safety games, but for the more general case of Muller games. We come back to the similarities to the 
latter reduction when we want to determine permissive strategies in the next section. 

Lemma 4. For all v G V: if [v] G W Q (& S ) then v G W Q (9). 

Proof. Let a be a uniform positional winning strategy for Player in ^ s and let R C V s be the set of 
vertices which are reachable from Wo(&s) H {[v] | v G V} by plays consistent with a. Every [w] eRC\Vq 
has exactly one successor in R (which is of the form [wv] for some v G V) and dually, every successor of 
[w] G R PI (which are exactly the classes [wv] with (Last(w) , v) G E) is in R. Now, let R max be the set 
of <jf] -maximal elements of R. Applying the facts about successors of vertices in R stated above, we 
obtain the following remark. 

Remark 2. Let R max be defined as above. 

1. For every [w] G R ma x H Vq, there is a v £V with (Last(w),v) G E and there is a \w'\ G /? max such 
that [wv] <jr, [w'\. 

2. For every [w] G R max H and each of its successors [wv], there is a [w 1 ] G /? m ax such that [wv] < 

k;. 

Thus, instead of updating the memory from [w] to [wv] (and thereby keeping track of the exact 
scores) when processing a vertex v, we can directly update it to a maximal element that is -larger 
than [wv] (and thereby over-approximate the exact scores). Formally, we define 9Jt = (M,Init,Upd) by 



This implies [w] <j?j Upd*(w) for every w G V + with Upd*(w) ^ JL. Thus, Last(w) = Last(w'), where 
[w'] = Upd*(w). Using Remark|2j we define the next-move function 



{v' if Last(w) = v, (v,v') G E, and there exists [w'] G R max such that [wv'] < ^ [w 1 ], 
v" otherwise (where v" is some vertex with (v,v") G E), 



and Nxt(v, _L) = v" for some v" with (v,v") G The second case in the case distinction above is just to 
match the formal definition of a next-move function; it is never invoked due to Last(w) = Last(w') for 
Upd*(w) = [w 1 ] or Upd*(w) = _L. 

3 We use the memory state ± to simplify our proof. It is not reachable via plays that are consistent with the implemented 
strategy and can therefore be eliminated. 



M = R max U{±}E 




if [v] G Wb(^s) and there exists [w] G R max with [v] <jr, [w], 
otherwise, 




[w'] if there is some [w'] G R max such that [wv] <jr, [w'] 
± otherwise. 
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It remains to show that the strategy a implemented by Wl and Nxt is a winning strategy for Player 
from W = {v | [v] G VVb(^s)}- An inductive application of Remark [2] shows that every play w that starts 
in W and is consistent with a satisfies Upd*(w) ^ _L. This bounds the scores of Player 1 by 2, as we 
have [w] <j?, Upd*(w) G R max C Plays <3 for every such play. Hence, d is indeed a winning strategy for 
Player from W. □ 

Using Lemma [3] and the construction in the proof of Lemma [4] proves Theorem 1 1121 
Corollary 1. Player has a finite-state winning strategy from Wq{^) whose memory states form an 
<g; x -antichain in Wq(^s)- 

To finish the proof of Theorem [H we determine the size of to prove the third statement. To this 
end, we use the concept of a latest appearance record (LAR) ifTTlfrTll . Note that we do not need a hit 
position for our purposes. A word I G V + is an LAR if every vertex v G V appears at most once in I. 
Next, we map each w G V + to a unique LAR, denoted by LAR(w), as follows: LAR(v) = v for every 
v G V and for w G V + and v G V we define LAR(wv) = LAR(w)v if v ^ Occ(w) and LAR(wv) = p\P2V 
if LAR(w) = p\vp2- A simple induction shows that LAR(w) is indeed an LAR, which also ensures that 
the decomposition of w in the second case of the inductive definition is unique. We continue by showing 
that LAR(vv) determines all but |LAR(w)| many of w's scores and accumulators. 
Lemma 5. Let w G V + and LAR(w) = v^v^-x ■ • • vi. 

1. w = XkVkXk-iVk-i ■ ■ -X2V2X\V\for some Xi G V* with Occ(^) C {vi, . . . , v,}/or every i. 

2. Scp(w) > if and only ifF = {vi, . . . , Vj} for some i. 

3. If Sc f(w) = 0, then Acc/r(w) = {vi,...,Vf} for the maximal i such that {vi,...,v,} C F and 
Accp (w) =d)ifno such i exists. 

4. LetSc F (w) >0andF = {vi,...,v,}. Then, Acc F (w) G {0} U {{vi, . . . ,v 7 } | j < i}. 
This characterization allows us to bound the size of and to prove Theorem 1 1131 

Lemma 6. We have \V S \ < (ij^ (^) -k\-2 k -k\\ +1 < (|V|!) 3 . 

Proof. In every safety game, we can merge the vertices in V \ F to a single vertex without changing 
W (^). Since [v] G F, we also retain the equivalence v G W*(5f ) 44> [v] G Wi(W s ). 

Hence, it remains to bound the index of Plays <3 1 = Tx ■ Lemma [5] shows that a play prefix w G V + 
has |LAR(w) | many sets with non-zero score. Furthermore, the accumulator of the sets with score 
is determined by LAR(w). Now, consider a play w G Plays <3 and a set F G with non-zero score. 
We have Scf (w) G {1,2} and there are exactly \F\ possible values for Acc^ (w) due to Lemma 15141 
Finally, LAR(w) = LAR(w') implies Last(vv) = Last(w'). Hence, the index of Plays <3 1=# is bounded 
by the number of LARs, which is ££=l if) tmies the number of possible score and accumulator 
combinations for each LAR £ of length k, which is bounded by 2 k ■ k\. □ 

In the proof of Theorem lll2[ we used the maximal elements of Player O's winning region of the safety 
game that are reachable via a fixed winning strategy. It is the choice of this strategy that determines the 
size of our memory structure. However, finding a winning strategy that visits at most k G N vertices in an 
arena (from a fixed initial vertex) for a given k is NP-complete. This can be shown by a reduction from 
the vertex cover problem (compare, e.g., |H1 where a more general result is shown). Moreover, it is not 
even clear that a small strategy also yields few maximal elements. 

In general, a player cannot prevent her opponent from reaching a score of 2, but there are arenas in 
which she can do so. By first constructing the subgame ^ up to threshold 2 (which is smaller than ^5), 
we can possibly determine a subset of Player O's winning region faster and obtain a (potentially) smaller 
finite-state winning strategy for this subset. But Example [2] shows that this approach is not complete. 
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5 Permissive Strategies for Muller Games 

Bemet et al. introduced the concept of permissive strategies for parity garnet HL a non-deterministic 
winning strategy that subsumes the behavior of every positional (non-deterministic) winning strategy. To 
compute such a strategy, they reduce a parity game to a safety game. The main observation underlying 
their reduction is the following: if we denote the number of vertices of priority c by n c , then a (non- 
deterministic) positional winning strategy for Player does not allow a play in which an odd priority c is 
visited n c + 1 times without visiting a smaller priority in between. This property can be formulated using 
scoring functions for parity games as well. The scoring function for a priority c counts the occurrences 
of c since the last occurrence of a smaller priority (such an occurrence resets the score for c to 0). Hence, 
our work on Muller games can be seen as a generalization of Bernet et al.'s work. While the bound n c on 
the scores in a parity game is straightforward, the bound 2 for Muller games is far from obvious. 

Since both constructions are very similar, it is natural to ask whether we can use the concept of 
permissive strategies for Muller games. In parity games, we ask for a non-deterministic strategy that 
subsumes the behavior of every positional strategy. As positional strategies do not suffice to win Muller 
games, we have to give a new definition of permissiveness for such games. In other words, we need to 
specify the strategies whose behaviors a permissive strategies for a Muller game should subsume. One 
way to do this is to fix a sufficiently large bound M and to require that a permissive strategy for a Muller 
games subsumes the behavior of every finite-state winning strategy of size at most M (this was already 
proposed by Bernet et al. for parity games HI). 

However, we prefer to take a different approach. By closely inspecting the reduction of parity to 
safety games, it becomes apparent that the induced strategy does not only subsume the behavior of every 
positional winning strategy, but rather the behavior of every strategy that prevents the opponent from 
reaching a score of n c + 1 for some odd priority c (in terms of scoring functions for parity games). It is this 
formulation that we extend to Muller games: a (non-deterministic) winning strategy for a Muller game 
is permissive, if it subsumes the behavior of every (non-deterministic) winning strategy that prevents the 
losing player from reaching a score of 3. We formalize this notion in the following and show how to 
compute such strategies from the safety game constructed in the previous section. 

A multi-strategy for Player i in an arena (V,Vo,V\,E) is a mapping a: V*Vi — > 2 V \ {0} such that 
v' G a(wv) implies (v, v') € E. A play p is consistent with a if p n+ \ S ci(po • • -p„) for every n such that 
p n £ Vj. We still denote the plays starting in a vertex v that are consistent with a multi-strategy a by 
Beh i3 /(v, a) and define Beh i£ /(W, a) = [jy^Beh^^v, a) for every subset W C V. A multi-strategy a is 
winning for Player from a set of vertices W in a game {stf , Win) if Beh £ /(W, o) C Win, and a multi- 
strategy x is winning for Player 1 from W, if Beh £ /(W, a) C V w \Win. It is clear that the winning regions 
of a game do not change when we allow multi-strategies instead of standard strategies. 

To define finite-state multi-strategies we have to allow a next-move function to return more than one 
vertex, i.e., we have Nxt: Vj x M ^ 2 V \ {0} such that V € Nxt(v,m) implies (v,v') € E. A memory 
structure Wl and Nxt implement a multi-strategy a via a(wv) = Nxt(v,Upd*(wv)). 

Definition 3. A multi-strategy o' for a Muller game £f is permissive, if 

1. o' is a winning strategy from every vertex in Wq(^), and 

2. Beha/(v, a) C Beh iC /(v, o')for every multi-strategy a and every vertex v with MaxSc & x (p) < 2 for 
every p € Beh^(v, a). 



4 A parity game consists of an arena srf and a priority function £2: V — > N. A play p is winning for Player if the 

minimal priority that is seen infinitely often during the play is even. 
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The original definition for parity games replaces the second condition by the following requirement: 
Behaf (v, a) C Beh^(v, a') for every positional multi-strategy a and every v from which a is winning. 

Example 4. Once again consider the Muller game of Example \2\ Starting at vertex 1, moving to is 
consistent with a winning strategy for Player that bounds Player 1 's scores by 2. Similarly, moving to 
2 is also consistent with a winning strategy for Player that bounds Player 1 's scores. Hence, we have 
(7'(1) = {0,2} for every permissive strategy O 1 . Now consider the play prefix 10. Here it is Player I's 
turn and he can use the self-loop either infinitely often (which yields a play that is winning for Player 0) 
or only finitely often (say n times) before moving back to vertex 1. In this situation, i.e., with play prefix 
10" +1 1, a strategy that bounds Player I's scores by 2 has to move to vertex 2. Hence, we must have 
a'(10" +1 1) D {2}. However, it is possible that we also have 1 G a / (10" +1 1), since a permissive strategy 
may allow more plays than the ones of strategies that bound Player 1 's scores by 2. However, at some 
point, a' has to disallow the move back to vertex 0, otherwise it would allow a play that is losing for her. 

Using the safety game defined in the previous section, we are able to show that Player always 
has a finite-state permissive strategy and how to compute one. 

Theorem 2. Let be a Muller game and <S$ the corresponding safety game as above. Then, Player 
has a finite-state permissive strategy for with memory states Wo(@s)- 

The proof is very similar to the one for Theorem II 121 (cf. the construction in the proof of Lemma 0]), 
but we have to use all vertices in Wq(@s) as memory states to implement a permissive strategy, only using 
the maximal ones (restricted to those reachable by some fixed winning strategy for the safety games) does 
not suffice. Furthermore, the next-move function does not return one successor that guarantees a memory 
update to a state from Wq(&s)> but it returns all such states. 

Proof. We define Wl = (M,Init,Upd) where M = W (& s ) U {-L}0, 



Init(v) 



[v] if [v]eW (Sfs), 
_L otherwise, 



u P d(M,v) 




if H G W (&s), 
otherwise. 



Hence, we have Upd*(w) = [w] G Wo(@s) as l° n g as every prefix x of w satisfies [x] G Wo(&s)* an d 
Upd*(w) = _L otherwise. We define Nxt by Nxt(v, _L) = {V} for some successor v' of v and 



Nxt(v, [w]) 



{v' | [wv'\ G W (&s)} if M e W (&s) and Last(w) = v, 

{v"} otherwise, where v" is some successor of v. 



Since every vertex in Wo(&s) H Vq has at least one successor in Wb(^s), the next-move function always 
returns non-empty set of successors of v in <S . 

It remains to show that the strategy a' implemented by Wl and Nxt is permissive. We begin by 
showing that a is winning from every vertex v G W&): due to Lemma|3l we have [v] G Wb(Sfs). Hence, 
the memory is initialized with [v] G Wo(^s). A simple induction shows Upd*(w) = [w] G Wb(^s) for 
every play prefix that starts in [v] is consistent with a'. This bounds Player I's scores by 2. Hence, a' is 
indeed winning from v. 



5 Again, we use the memory state _L to simplify our proof. It is not reachable via plays that are consistent with the strategy 
implemented by 9Jt and can therefore eliminated and its incoming transitions can be redefined arbitrarily. 
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Finally, consider a multi-strategy a and a vertex v such that MaxScjr^p) < 2 for every play p G 
Behj/ (v, <j). We have to show that every play p G Beh c /(v, a) is consistent with a'. Since a is winning 
from v (as it bounds Player l's scores), we have v G Wo(&). Now, assume p is not consistent with a' 
and let po • • • p n Pn+\ be the shortest prefix such that p„ + i ^ cr'(po • • • p„). Then, we have [po • • • p«p,i+i] ^ 
Wo(Sfs)- Hence, Player 1 has a strategy to enforce a visit to V s \F in 5fj starting in \p$ ■ ■ ■ p n p n+ \]. 
Player 1 can mimic this strategy in to enforce a score of 3 against every strategy of Player when 
starting with the play prefix po • • • p n Pn+i- Since this prefix is consistent with a, which we have assumed 
to bound Player l's scores by 2, we have derived the desired contradiction. □ 

6 Safety Reductions for Infinite Games 

It is well-known that classical game reductions are not able to reduce Muller games to safety games, 
since they induce continuous functions mapping (winning) plays of the original game to (winning) plays 
of the reduced game. The existence of such functions is tied to topological properties of the sets of 
winning plays in both games. However, we transformed a Muller game to a safety game which allowed 
us to determine the winning regions and a winning strategy for one player. This is possible, since our 
reduction does not induce a continuous function: a play is stopped as soon as Player 1 reaches a score 
of 3, but it can (in general) be extended to be winning for Player 0. 

In this section, we briefly discuss the reason why Muller games can not be reduced to safety games 
in the classical sense, and then we present a novel type of game reduction that allows us to reduce many 
games known from the literature to safety games. The advantage of this safety reduction is that the 
reduced game is always a safety game. Hence, we can determine the winning regions of various games 
from different levels of the Borel hierarchy using the same technique. However, we only obtain a winning 
strategy for one player, and to give such a reduction, we need to have some information on the type of 
winning strategies a player has in such a game. Let us begin by discussing classical game reductions. 

An arena = (V,Vo,V\,E) and a memory structure 9JT = (M,Init,Upd) for srf induce the expanded 
arena st x Wl = (V x M, V x M, Vi x M,E') where ((s,m), (v 1 ,m')) G E' if and only if (v,v') € E and 
Upd(ra,v') = m' . Every play p in srf has a unique extended play p' = (p ,rao)(pi ,m{)(p2,m2) ■ ■ ■ in 
stf x Wl defined by mo = Init(po) and m n+ \ = Upd(m„,p„ + i), i.e., m n = Upd*(po • ■ • p n ). A game = 
(£?, Win) is reducible to ' = {sf\ Win') via SOT, written ( S < m <£' , if s4' = £/ x 9JT and every play p in 
is won by the player who wins the extended play p' in W, i.e., p G Win if and only if p' G Win'. 

Lemma 7. Let 5f be a game with vertex set V and W C V. If@ <<jji ( S' and Player i has a positional 
winning strategy for C S' from {(v, Init(v)) | v G W}, then she also has a finite-state winning strategy with 
memory %Rfor from W. 

The set Win^ Q V m of winning plays of a Muller game is in general on a higher level of the Borel 
hierarchy than the set Wins Q U m of winning plays of a safety game. Hence, in general, there exists no 
continuous (in the Cantor topology) function /: V m — ► U m such that p G Win^ if and only if f(p) G 
Wins ( see > e.g., |[T5l ). Since the mapping from a play in s4 to its extended play in s$ x 9Jt is continuous, 
we obtain the following negative result (which holds for other pairs of games as well). 

Corollary 2. In general, Muller games cannot he reduced to safety games. 

To overcome this, we present a novel type of game reduction which encompasses the construction 
presented in Section [4] and is applicable to many other infinite games. 
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Definition 4. A game = (&f, Win) with vertex set V is (finite-state) safety reducible, if there is a regular 
language L C V* of finite words such that: 

• For every play p G V a : z/Pref(p) C L, f/jerc p G Win. 

• If v G Wq(W), then Player has a strategy a such that Pref(Beh(v, a)) C L. 

Note that a strategy a satisfying Pref(Beh(v, a)) C L is winning for Player from v. Many games 
appearing in the literature on infinite games are safety reducible, although these reductions do neither 
yield fast solution algorithms nor optimal memory structures. 

• In a Biichi game <£ , Player has a positional winning strategy such that every consistent play visits 
a vertex in F at least every k = \V\F\ steps. Hence, is safety reducible with L = Pref(((V \ 
F)- k ■ F) a ). 

• In a co-Biichi game , Player has a positional winning strategy such that every consistent play 
stays in F after visiting each vertex in V \ F at most once. Hence, <3 is safety reducible with 
L = Pref({w • F m | each v € V \F appears at most once in w}). 

• In a request-response game W, Player has a finite-state winning strategy such that in every con- 
sistent play every request is answered within k = \V\ ■ r ■ 2 r+l steps, where r is the number of 
request-response pairs ll2~T1l . Hence, <S is safety reducible to the language of prefixes of plays in 
which every request is answered within k steps. 

• In a parity game, Player has a positional winning strategy such that every consistent play does 
not visit n c + 1 vertices with an odd priority c without visiting a smaller even priority in between, 
where n c is the number of vertices with priority c. Hence, <S is safety reducible to the language of 
prefixes of plays satisfying this condition for every odd priority c. 

• Lemma [2] shows that a Muller game is safety reducible to the language of prefixes of plays that 
never reach a score of 3 for Player 1 . 

Let 8$ = (V,Vo,V\,E) be an arena and let 21 = (Q,V,qQ,8,F) be a deterministic finite automaton 
recognizing a language over V. We define the arena si X 21 = (V X Q,Vo X Q,Vi X Q,E o 5) where 
((v,q),(V,q')) 6 E o 8 if and only (v,v') £E and 8{q,v')=q'. 

Theorem 3. Let be a game with vertex set V that is safety reducible with language L(2l) for some 
DFA 21 = (Q,V,q ,8,F). Define the safety game <£' = {st x 21, V x F). 

1. v£ W (?) if and only if (v,8(q ,v)) G W (&'). 

2. Player has a finite-state winning strategy from Wq(&) with memory states Q. 

It is easy to show that every game in which Player has a finite-state winning strategy is safety- 
reducible to the prefixes of plays consistent with this strategy, which is a regular language. However, for 
this construction, we need a finite-state winning strategy, i.e., there is no need for a safety reduction. 

If £f is determined, then Theorem 131 H is equivalent to v G W/(Sf ) if and only if (v,8(qo,v)) G W|-(£f '). 
Hence, all games discussed above can be solved by solving safety games. We conclude by mentioning 
that safety reducibility of parity games was used implicitly to construct an algorithm for parity games lfl4l 
and to compute permissive strategies for parity games [1]. Furthermore, the safety reducibility of co- 
Biichi games is used implicitly in work on bounded synthesis [20] and LTL realizability [ 10], so-called 
"Safraless" constructions |[T6l which the do not rely on determinization of automata on infinite words. 

Furthermore, the new notion of reduction allows to generalize permissiveness to all games discussed 
in Example [3j if a game is safety reducible to L, then we can construct a multi-strategy that allows every 
play p in which Player 1 cannot leave L starting from any prefix of p. Thereby, we obtain what one could 
call L-permissive strategies. For example, this allows to construct the most general non-deterministic 
winning strategy in a request-response game that guarantees a fixed bound on the waiting times. 
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7 Conclusion 

We have shown how to translate a Muller game into a safety game to determine both winning regions 
and a finite-state winning strategy for one player. Then, we generalized this construction to a new type 
of reduction from infinite games to safety games with the same properties. We exhibited several implicit 
applications of this reduction in the literature as well as several new ones. Our reduction from Muller 
games to safety games is implemented in the tool GAVS-t|^| [6]. In the future, we want to compare the 
performance of our solution to classical reductions to parity games as well as to direct algorithms. 

Our construction is based on the notion of scoring functions for Muller games. Considering the 
maximal score the opponent can achieve against a strategy leads to a hierarchy of all finite-state strategies 
for a given game. Previous work has shown that the third level of this hierarchy is always non-empty, 
and there are games in which the second level is empty. Currently, there is no non-trivial characterization 
of the games whose first or second level of the hierarchy is non-empty, respectively. 

The quality of a strategy can be measured by its level in the hierarchy. We conjecture that there is 
always a finite-state winning strategy of minimal size in the least non-empty level of this hierarchy, i.e., 
there is no tradeoff between size and quality of a strategy. This tradeoff may arise in many other games 
for which a quality measure is defined. Also, a positive resolution of the conjecture would decrease the 
search space for a smallest finite-state strategy. 

We used scores to construct a novel antichain-based memory structure for Muller games. The an- 
tichain is induced by a winning strategy for the safety game. It is open how the choice of such a strategy 
influences the size of the memory structure and how heuristic approaches to computing winning strate- 
gies that only visit a small part of the arena |fl9l influence the performance of our reduction. 

Finally, there is a tight connection between permissive strategies, progress measure algorithms, and 
safety reductions for parity games: the progress measure algorithm due to Jurdziriski lfl4l and the re- 
duction from parity games to safety game due to Bernet et al. HI to compute permissive strategies are 
essentially the same. Whether the safety reducibility of Muller games can be turned into a progress 
measure algorithm is subject to ongoing research. 
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